Conference Proceedings
E-Storm: Replication-based State Management in Distributed Stream Processing Systems
Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin Rubinstein, Rajkumar Buyya
Proceedings of the International Conference on Parallel Processing | IEEE COMPUTER SOC | Published : 2017
DOI: 10.1109/ICPP.2017.66
Abstract
Apache Storm is a fault-tolerant, distributed inmemory computation system for processing large volumes of high-velocity data in real-time. As an integral part of the fault-tolerance mechanism, Storm's state management is achieved by a checkpointing framework, which commits states regularly and recovers lost states from the latest checkpoint. However, this method involves a remote data store for state preservation and access, resulting in significant overheads to the performance of error-free execution.In this paper, we propose E-Storm, a replication-based state management system that actively maintains multiple state backups on different worker nodes. We build a prototype on top of Storm by ..
View full abstractRelated Projects (1)
Grants
Awarded by Australian Research Council Future Fellowship
Funding Acknowledgements
The authors would like to thank Chenhao Qu and other members of the CLOUDS Lab at the University of Melbourne for their valuable comments towards improving the quality of the paper. This work is supported by Australian Research Council Future Fellowship and DP150103710.